ESD-TR-7  2-77 


ESD  ACCESSION  LIST 

TRl  Call  No._  _7Aa3l± - 

Copy  No. _ /_ 


of 


cys. 


ESD  RECORD  COPY 

RETURN  TO 

SCIENTIFIC  &  TECHNICAL  INFORMATION  DIVISION 


(TRl),  Building  1210 

Technical  Report 

494 

J.  W.  Modestino 

Optimum  Detection 
of  M-ary  Orthogonal  Signals 
in  ELF  Noise  Environments 

17  March  1972 

Prepared  for  the  Department  of  the  Navy 
under  Electronic  Systems  Division  Contract  F 196 28-7 O-C-O 230  by 

Lincoln  Laboratory 

MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY 

Lexingtony  Massachusetts 

* 

A07«3OC^ 


Approved  for  public  release;  distribution  unlimited. 


MASSACHUSETTS  INSTITUTE  OF  TECHNOLOGY 


LINCOLN  LABORATORY 

OPTIMUM  DETECTION  OF  M-ARY  ORTHOGONAL  SIGNALS 
IN  ELF  NOISE  ENVIRONMENTS 

].  W.  MODESTINO 
Consultant  to  Group  66 


TECHNICAL  REPORT  494 

17  MARCH  1972 


Approved  for  public  release;  distribution  unlimited. 


LEXINGTON 


MASSACHUSETTS 


The  work  reported  in  this  document  was  performed  at  Lincoln  Laboratory, 
a  center  for  research  operated  by  Massachusetts  Institute  of  Technology. 
The  work  was  sponsored  by  the  Department  of  the  Navy  under  Air  Force 
Contract  F19628-70-C-0230. 

This  report  may  be  reproduced  to  satisfy  needs  of  U.S.  Government  agencies. 


Non-Lincoln  Recipients 

PLEASE  DO  NOT  RETURN 

Permission  is  given  to  destroy  this  document 
when  it  is  no  longer  needed. 


u 


ABSTRACT 


Explicit  receiver  structures  for  the  optimum  detection  of  M-ary  orthogonal 
signals  in  impulsive  noise  environments  typical  of  ELF  are  determined. 
The  resulting  optimum  structures,  while  shown  to  bear  some  resemblance 
to  that  which  would  have  been  obtained  in  the  presence  of  Gaussian  noise 
alone,  exhibit  an  interesting  nonlinear  behavior.  The  difficulties  in  the  ac¬ 
tual  implementation  of  this  receiver  are  discussed,  and  several  definitely 
suboptimum  yet  computationally  superior  structures  are  suggested.  The 
results  are  felt  to  be  applicable  to  a  much  broader  class  of  channels  char¬ 
acterized  by  impulsive  interference. 
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OPTIMUM  DETECTION  OF  M-ARY  ORTHOGONAL  SIGNALS 
IN  ELF  NOISE  ENVIRONMENTS 


I.  INTRODUCTION 

In  previous  work/  a  model  for  ELF  noise  was  presented  which  consisted  of  the  sum  of  a 
low-density  shot  process  and  white  Gaussian  noise  (WGN).  In  the  present  report,  we  consider 
the  structure  of  optimum  receivers  for  the  detection  of  M-ary  orthogonal  signals  in  background 
noise  as  described  by  this  model.  While  it  is  not  the  intent  here  to  suggest  that  the  optimum  de¬ 
tector  structure  should  actually  be  employed  in  practice  (generally  it  is  quite  complicated),  in¬ 
stead  the  reasons  for  studying  optimum  processing  structures  are  twofold: 

(a)  To  provide  a  benchmark  against  which  the  performance  of  suboptimum  schemes 
can  be  compared, 

(b)  To  provide  insight  which  may  suggest  computationally  superior  yet  reasonably 
efficient  suboptimum  structures. 

2 

The  binary  detection  problem  has  been  considered  previously.  We  shall  treat  a  more  gen¬ 
eral  situation  where  the  channel  signaling  alphabet  consists  of  M  orthogonal  equi-energy  wave¬ 
forms.  The  approach  to  be  used  follows  closely  the  optimal  dynamical  filtering  results  associated 

3  4  5  6 

with  the  names  of  Kalman  and  Bucy,  Kushner,  '  and  Wonham. 

II.  PRELIMINARIES 


The  received  signal  is  assumed  of  the  form 
r(t)  =  s  (t )  +  n(t) 

where  s (t )  is  one  of  the  a  priori  known  signal  waveforms  s.(t),  i  =  1,2,  ...  ,M  satisfying 


s.  (t)  s.(t)  dt  =  E  6. . 
i  J  s  ij 


(1) 


(2) 


with  <5 _  the  Kronecker  delta,  and  T  the  observation  interval.  The  additive  signal  independent 
noise  n(t)  appearing  in  Eq.  (1)  is  a  sample  function  of  a  non-Gaussian  noise  process  of  the  form 


n(t)  =  y(t)  +  w(t)  ,  (3) 

whore  w(t)  is  a  zero-mean  WGN  process  with  variance  ~  N^/2  and  Nq  the  double-sided  noise 
spectral  density  in  watts/llz.  The  process  y(t),  on  the  other  hand,  is  a  low-density*  shot  process 
obtained  by  exciting  a  finite -state  linear  dynamical  system  bv  a  random  impulse  train.  In  partic¬ 
ular,  y(t)  is  described  by  the  state  equation 


x(t)  =  A  x  (t)  +  b  u(t) 


(4) 


and  the  output  relation 

y(t)  =  <c,x(t)>  ,  (5) 

where  A  is  a  constant  n  X  n  matrix,  b  and  c_  are  n-vectors  (specifically  column  vectors)  and 
<[■  ,  y  represents  an  inner  product.  The  state  vector  *L(t)  is  itself  an  n-vector  while  the  quantity 

*  By  "low-density"  we  mean  that  the  average  inter-arrival  times  between  exciting  impulses  is  greater  than  any 
system  time  constants. 
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u(t)  appearing  in  (4)  is  a  scalar  impulse  process  independent  of  w(t)  and  of  the  form 
N(t) 

u(t)  -  V  u.<5(t  —  L)  .  (6) 

i=l 

Here  (N(t),  t  >  0}  is  a  counting  process7  with  arrival  times  t^,  t^,  .  .  .  assumed  Poisson  distrib¬ 
uted  with  intensity  A  events/second,  and  {u.}  is  a  sequence  of  zero-mean  independent  and  iden¬ 
tically  distributed  (i.i.d)  random  pulse  amplitudes  with  common  univariate  probability  density 
function  (p.d.f.)  f(-).  In  Ref.  1,  two  different  amplitude  distributions  were  considered:  the  double¬ 
sided  power-Rayleigh  distribution  with  p.d.f. 


f(u) 


i  n-1 


2R 


exp 


0  <  cv  ^  2 


(7) 


and  the  double-sided  lognormal  distribution  with  p.d.f. 


f(u)  = 


exp  - 


<7  u 


ln2|u 


2(7 


(8) 


with  a2  the  variance  of  a  zero-mean  Gaussian  random  variable  g  for  which  u  =  e^. 

The  suitability  of  this  model  for  representing  ELF  noise  environments  is  discussed  in  some 
detail  in  Ref.  1  and  need  not  be  repeated.  The  significant  parameters  which  must  be  specified 
then  are: 

(a)  The  p.d.f.  f{*)  of  the  pulse  amplitudes, 

(b)  The  intensity  A  of  the  Poisson  process, 

(c)  The  linear  system  dynamics,  i.e.,  the  quantities^,  b,  and  c  in  Eqs.  (3)  and  (4), 

(d)  The  ratio  y  of  the  rms  value  of  the  shot  component  to  the  Gaussian  background 
component. 


It  is  significant  that  these  parameters  can  be  adjusted  quite  easily  to  match  observed  ELI"  noise 

recorded  at  various  geographical  locations.  Observe  that  the  choice  of  background  noise  level 
2 

a  is  somewhat  arbitrary  and  amounts  to  a  trivial  rescaling  of  the  n(t)  process. 


III.  OPTIMUM  RECEIVER  STRUCTURE 

It  will  be  assumed  that  the  receiver  bases  its  decisions  upon  the  relative  likelihood  that  a 
particular  signal  has  been  sent  and  resulted  in  the  observation  r^  rp  where,  in  general, 
rQ  £  { t(t),  0  <  t  <  t}.  In  particular,  the  decision  strategy  is  given  by:  announce  signal  i  as 
having  been  transmitted  i  =  1,  2,  .  .  .  ,  M  iff 

P  lro  t  I  i]  =  max  P(rnTljJ  >  (9> 

l<j<M 

where  p  [r^  T|j|  is  the  probability  (likelihood)  of  having  received  rQ  T  given  that  the  signal 
j  =  1,  2,  .  .  .  ,  M  had  been  transmitted.  It  will  prove  convenient  to  normalize  the  likelihoods  and 
in  fact  to  base  decisions  upon  the  logarithm*  of  the  normalized  likelihoods  so  that  the  modified 
decision  rule  becomes:  announce  signal  i  as  having  been  transmitted  i  =  1,  2,  .  .  .  ,  M  iff 

*  Actually  any  monotonic  function  suffices. 
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(10) 


where 


Mrn  T:  TJ  =  max  Mrn  t;T1 
1  U>1  l<j<M  J  U>1 


a  P  lro  t 

Ajlr0.t;t]-ln  p0[rM]  '  j  = 


(11) 


and  Pq[*]  is  the  probability  distribution  of  the  WGN  process  w(t).  Note  the  explicit  dependence 
of  the  log-likelihoods  X ^ [ - ;  t  J  on  the  time  t.  Using  some  recent  results  of  Kailath/*  the  log- 
likelihoods  assume  a  particularly  convenient  form.  More  specifically,  under  the  assumption 


dt  <  °o 


f  E{y2(t)} 

J0 

where  E{-}  represents  the  expectation  operator,  Kailath  demonstrates  that 

*j[r0  t;t]  =  j  h^(r)  d^(r)  -  y  ^2(r)  dr  ;  j  =  l,2,  ...,M 


(12) 


(13) 


where  d??(t)  =  r(t)  dt,  and  hr(t)  represents  the  least  mean-square  estimate  of  the  signal  plus  shot 
noise  process  hr(t)  =  sj(t)  +  y(t).  In  particular,  due  to  the  additivity  of  the  expectation  operator, 


hj(t)  =  Sj  (t)  +  <c,x(t|j)> 


(14) 


where 


x(t|j)  =  E{x(t)|r0  t,  j}  ;  j  =  1,  2, 


,  M 


(15) 


is  the  least  mean-square  estimate  of  the  state  vector  x  (t)  given  that  signal  j  has  been  transmit¬ 
ted  and  has  resulted  in  the  observation  rQ  t  =  {r(r),  0  r  ^  t}.  The  first  integral  in  (13)  must 
be  interpreted  with  care  as  a  stochastic  integral  —  an  Ito  integral  to  be  precise.  Nevertheless, 
the  practical  interpretation  of  this  integral  is  as  a  cross-correlation  operation  between  the  re¬ 
ceived  process  and  the  least  mean-square  estimate  of  the  signal  plus  shot  noise  component.  The 
resulting  receiver  structure  is  illustrated  in  Fig.  1  which  would  be  identical  to  the  optimum  de¬ 
tector  for  known  signals  in  Gaussian  noise  were  it  not  for  the  presence  of  the  estimators  required 

to  recover  the  {h.(t)j  from  the  received  data.  Indeed,  from  (14)  if  there  were  no  shot  noise  pres- 
a  J 

ent,  hr(t)  =  Sj(t)  independent  of  the  received  data  and  the  detector  reduces  to  the  familiar  correla¬ 
tor  or  matched  filter  receiver.  The  presence  of  the  shot  noise  component,  on  the  other  hand, 
requires  explicit  computation  of  the  estimates  {lr(t)},  or  equivalently,  (x(t|j)},  and  complicates 
the  receiver  implementation  considerably.  The  remainder  of  this  report  will  be  concerned  with 
the  development  of  computational  algorithms  for  generating  the  least  mean-square  estimates 
{x  (t  |j)}  from  the  noisy  received  data. 

Let  us  first  recall  from  the  definition  of  x  (t  |  j )  the  explicit  assumption  that  the  j**1  signal  was 

transmitted  and  resulted  in  the  observation  rj  .,  i.e.,  from  (15) 

l0,r 


£(t|j)  =  \  n  xp{x;t|r  t,j}  dx 


=  1.  2. 


M 


(16) 


*  The  fact  that  we  are  using  the  variable  X  to  designate  the  intensity  of  the  Poisson  point  process  and  { X j }  to  des¬ 
ignate  log -likelihoods  should  cause  no  confusion. 

|  Note  that  we  have  explicitly  assumed  tQ  =  0  above. 
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ESTIMATOR/CORRELATOR  BRANCH 


Fig.  1 .  Optimum  detector  structure. 
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Fig.  2.  Generation  of  least-squares  estimates. 
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where  the  conditional  probability  density  function  p{x;t|r  j}  represents  the  probability  of 

observing  the  state  x  (t)  =  x  at  time  t  given  that  the  j  n  signal  was  transmitted  and  has  resulted 
in  the  observation  r.  +  .  The  integration  in  (1 6)  is  over  the  n-dimensional  Euclidean  space  Rn 

V 

with  volume  element  dx.  Note  that  we  again  explicitly  include  the  time  t  in  the  conditional  prob¬ 
ability  expression  to  emphasize  its  dependence  upon  time.  It  follows  that 


p{x;t|r  t,j}  =  p{x;t|v.(T),  t0<  t<  t}  ;  j  =  1, 2,  ....  M 
tfV  J 


(17) 


.th 


where  v^(t)  =  r(t)  ”  s^(t)  since,  given  that  the  j  signal  was  transmitted  and  resulted  in  the  ob¬ 
servation  r  ,,  it  is  only  this  difference  representing  the  additive  noise  process  n(t)  which  is 
t0,t 

relevant  to  the  determination  or  estimation  of  the  state  vector  x(t).  Thus,  the  estimators  x.(t  |j), 

A 

or  equivalently  h.(t),  j  =  1,  2,  .  .  .  ,M  can  be  implemented  as  illustrated  in  Fig.  2.  Here  the  quan- 

A 

tity  Xq(^)  represents  the  least  mean-square  estimator  of  the  state  vector  x(t)  given  observations 
on  the  additive  noise  process  n(t)  in  the  interval  [tQ,t],  i.e., 


xQ(t)  =  E{x  (t)  |n^  J 


(18) 


Observe  that  the  input  to  the  estimator  £0(t)  in  Fig.  2  is  actually  v^(t)  which  will  be  equal  to  n(t) 
only  if  the  signal  was  in  fact  transmitted.  If,  instead,  the  i**1  signal  was  actually  transmit¬ 
ted,  then  Vj(t)  contains  in  addition  to  the  noise  process  n(t)  a  bias  term  equal  to  the  difference 
between  the  i**1  and  j**1  signals.  It  will  be  convenient  then  to  let  v(t)  =  n(t)  represent  a  generic 
input  to  the  estimator  in  Fig.  2  which  will  equal  v^(t)  in  the  j**1  estimator  branch.  Similarly,  we 

will  drop  subscripts  in  describing  the  estimator  x  Q(t)  given  by  (18)  and  proceed  then  to  evaluate 


x  (t)  = 


(19) 


or  more  precisely  the  conditional  probability  density  p{x;t|v  ,}.  In  the  following  section  we 

V 

will  derive  a  modified  Fokker-Planck  equation  which  governs  the  time  evolution  of  this  condi¬ 
tional  density.  In  particular,  it  will  be  shown  that 


9p{x;t|vt  t} 

— -  =  L+p{x;t|vt  t}  +  ao_2p{x;t|vt  t) 


at 


X  [v(t) -<c  ,x  (t)> ]  [  <c  ,x(t)>  -<£,x(t)>]  ,  (20) 

where  L+{*}  is  the  forward  differential  generator  of  the  shot  process  given  in  this  case  by 

/3Ax(t)  {-}\ 

L+{.}  ='Tr(^^ - j  -A(t)  E[Au{-}]  ,  (21) 

where  Tr  represents  the  trace  of  a  matrix,  is  the  n  X  n  matrix  described  in  (4)  and  the  sequel, 
A (t)  is  the  intensity*  of  the  exciting  Poisson  point  process.  The  expectation  in  (21)  is  with  re¬ 
spect  to  the  pulse  amplitude  distribution  with  generic  random  variable  u,  and  finally  ^u{*}  is 
the  difference  operator  defined  for  any  continuous  functional  on  Rn  by 

Au{g(x)}  =  g(x)-g(2L'bu)  ,  (22) 

*  Note  that  we  are  allowing  the  intensity  to  be  possibly  a  function  of  time. 
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where  b  has  been  defined  in  connection  with  (4).  Equation  (20)  describes  the  complete  time 

evolution,  subject  of  course  to  appropriate  boundary  conditions,  of  the  conditional  density 

p{x_;t|vf  ,}  and  hence  the  h.(t),  j  =  1 , 2,  .  .  . ,  M  from  Fig.  2.  The  operator  portion  L+{*}  repre- 
t0’  J 

sents  the  Fokker-Planck  equation  for  the  shot  process  alone  while  the  second  term  in  (20)  is  due 

9 

to  the  effect  of  noisy  observations  on  the  state  x  (t).  This  expression  is  quite  similar  to  that 
which  would  have  been  obtained  if  the  point  process  u(t)  were  WGN  except  for  different  interpre¬ 
tations  of  the  operator  L+{-}.  The  expression  given  by  (20)  can  be  used  to  obtain  the  explicit 

a  a 

structure  of  the  estimator  x.  (t)  and  hence  the  hj(t),  j  =  1,  2,  .  .  .  ,  M. 

IV.  DERIVATION  OF  MODIFIED  FOKKER-PLANCK  EQUATION 

Of  the  various  approaches  to  the  derivation  of  the  modified  Fokker-Planck  equation  given 

4  5 

by  (20),  the  approach  taken  here  will  follow  where  possible  one  first  given  by  Kushner,  *  which 

9  10 

has  by  now  become  fairly  standard.  '  The  derivation  proceeds  in  two  steps.  First,  we  assume 

the  conditional  probability  density  p{x;t|v,  }  is  given  and  proceed  to  determine  the  effect  on 

l0'1 

this  quantity  of  an  incremental  change  in  the  conditioning  or  observation  process  v(t).  Then  the 
effects  of  an  incremental  change  in  the  state  x  (t)  are  considered  leading  finally  to  the  modified 
Fokker-Planck  equation  described  by  (20).  It  will  prove  convenient  at  this  time  to  replace  the 
conditioning  upon  the  v(t)  process  by  conditioning  upon  the  process  z(t),  defined  by  dz(t)  =  v(t)  dt, 
which  is  possible  due  to  the  one-to-one  relationship  between  them.  In  particular,  we  have 

p{x;t|vt^t+dt}  =  p{x;t|z  ,dz(t)}  (23) 


which  allows  us  to  proceed  with  Step  1  of  the  derivation. 

Step  1 :  It  follows  from  application  of  Bayes  rule  that 

p{ dz(t)  |  z  x(t)>  p{x;t|z  } 
Lrvt  Ln»L 


p{x;  1 1  z  ,,  dz(t)} 
l0' 


p{dz(t)|z  } 
l0'1 


p{dz(t)|x(t)}  p{x;t  |z  } 

Lry 1 

=  7 - B -  ,  (24) 

J  p{dz(t)  |x  (t)}  p{x;t|z  }  dx 
R  l0'1 

where  the  integration  is  over  the  n-dimensional  real  Euclidean  space  Rn  and  we  have  made  use 
of  the  fact  that 

p{dz(t)|z  f,x(t)}  =  p{dz(t)  |x  (t)}  (25) 

V 

since  conditioning  on  z .  ,  given  x  (t)  can  give  no  new  information  about  dz(t). 

t0,t 


Since 

dz(t)  =  n(t)  dt  =  [<c,x(t)>  +  w(t ) ]  dt  ,  (26) 

it  follows  that  given  x.(t),  dz(t)  is  Gaussian  with  mean 

E{dz(t)|x(t)}  =<c,x(t)>  dt  (27) 
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and  variance 


var{dz(t) \x  (t)}  =  vq  dt  , 


where  is  the  variance  of  the  WGN  process  w(t)  as  described  previously.  Thus, 
p{dz{t)  jjt  (t)}  =  1  —  exp  | - i — |dz(t)  -  <£,  x  U)>  dt)2  J 

J7^7t  '  2co dt  I 


(28) 


(29) 


Now  defining 


y  (dz(t),  dt;  t  J  = 


P{^'tlZt0,t+dt> 

p{x;t|z  } 


(30) 


it  follows  from  (24)  together  with  (29)  after  canceling  common  factors  in  both  the  numerator  and 
denominator  that 


y  [dz(t),  dt;  t]  = 


exp{<ro^  dz(t)  ,  x(t)>  —  0-5aQ  2  <9. ,x(t)>^  dt} 

/  n  exp{co2dz(t)  <c  ,x>—  0.5o-o-2  <c,_x>2  dt}  p{x;t|zt  t}  dx 
R,  O' 


(31) 


Now  expanding  y  [dz(t),dt;t]  in  a  Taylor  series  about  dz(t)  =  dt  =  0  it  can  be  shown  by  fairly 
standard  arguments^'* °  that  up  to  terms*  o(dt) 

y  [dz(t),  dt;  t]  =  1  +  [dz(t)  -  <c  ,  x_(t)>  dt]  •  [<c  ,  x  (t)>  -  <c  ,  x  (t)>] 


(32) 


where  jc(t)  is  the  conditional  or  minimum  mean-squared  estimate  given  by  (18)  or  equivalently 
(19).  Substituting  this  result  into  the  defining  equation  (30)  for  y  [dz(t),  dt;  t],  we  have 


p{x ;  1 1 ztn>t+dt^  =  P<*;tK0,t}  +  dq  !^;tJ 


(33) 


where 


dq  [x  ;  t  ]  = 


p{x;t|z  } 
Ln*L 


[dz(t)  -  <c  ,£(t)>  dt]  •  [<c  ,  x(t)>  -  <c  ,  x  (t)>  ]  .  (34) 


The  expression  given  by  (33)  then  represents  the  incremental  change  in  the  conditional  density 
p{x;t|z  .}  (or  equivalently  p{.x;t|vf  t})  due  to  an  incremental  change  in  the  observation  alone. 

V  l0’1 

The  derivation  of  the  modified  Fokker-Planck  equation  is  completed  in  Step  2  by  determining  the 
effect  upon  p{x;t|z  }  of  an  incremental  change  in  the  value  of  the  state  vector  x(t). 

V 


Step  2:  It  is  here  that  we  depart  from  the  Kushner  derivation  which  depends  heavily  upon 
the  higher  order  nature  in  dt  of  the  higher  order  moments  of  dx(t).  Such  is  not  the  case  for  the 
low-density  shot  processes  being  considered  here.  As  an  alternative,  we  present  an  approach 
suggested  by  Snyder.2  In  particular,  observe  that 

p{x;t  +  dtlz^  ^}  dx  =  Pr|x(t  +  dt)  e  1^ |ztQ>t+dt|  .  (35) 

*  By  the  notation  o(dt)  we  mean  a  quantity  such  that  lim  [o(dt)/dt]  =  0. 

dt-0 
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where  I^1  is  a  region  of  volume  dx  centered  at  the  point  x.  in  Rn.  Let  us  assume  that  dt  is  so 
small  that  the  probability  of  more  than  one  impulse  occurring  in  the  interval  (t,  t  +  dt)  is  o(dt). 
Then  up  to  higher  order  terms  in  dt  we  have 

Pr  |x  (t  +  dt)  €  i"  |z  t+dtJ  =  [1  -  A(t)  dt]  Pr  jx  (t  +  dt)  e  I  ”  |z^  t+dt’  dN*t*  =  °| 


+  X(t)  dt  Pr|x(t  +  dt)  c  I^|z  +dt,  dN(t)  =  l|  ,  (36) 

where  as  before  A (t )  is  the  intensity  of  the  driving  Poisson  point  process*  and  dN(t)  is  the  num¬ 
ber  of  impulses  which  have  occurred  in  the  infinitesimal  interval  (t,  t  +  dt).  Observe  that  if 
dM(t)  =  0, 

x  (t  +  dt)  =  $(dt)  x  (t)  ,  (37) 


where  Jj(t)  =  exp  {At}  is  the  state  transition  matrix  associated  with  the  linear  system  generating 

1 1 

the  shot  process.  Since  $(r)  is  nonsingular 

Pr|x(t  +  dt)  £  I^|zto>t+dt.  dN(t)  =  0 J  =  Pr|x(t)  c  £ '(dt)  I>t()ft+dt|  ,  (38) 

where  by  the  symbolic  notation  $  d  (dt)  In  we  mean  the  set  {4  e  Rn:4>(dt)  4  e  I  n} .  Similarly,  if 
dN(t)  =  1,  ~  ~ 


r  t+dt 

x_(t  +  dt)  =  4> ( dt )  x  (t)  +  \  4>(t  +  dt  -  a)  bu(a)  da 

Jt 

=  ( dt )  x(t)  +  u<I>(t  +  dt  —  r)  b  ,  (39) 


where  r  is  the  time  of  occurrence  of  the  single  impulse  with  t  <  r  <  t  +  dt,  and  u  is  a  generic 
random  variable  with  probability  density^  f(-)  representing  the  random  amplitude  of  the  exciting 
pulse  train.  Since 

2 

|(T)  =  MrA  +  y  A  +...  .  (40) 

with  l  the  n  X  n  identity  matrix,  it  follows  that  it  is  possible  to  write  (39)  as 

x_  (t  +  dt)  =  $(dt)  x_  (t)  +  b  u  +  O(dt)  (41) 

so  that  finally  after  retaining  terms  up  to^  O(dt) 

Pr|x(t  +  dt)  e  I^|zto>t+dt.  dN(t)  =  l) 


/  *°°  \  \ 

=  J  Pr{x(t)  «  |  (dt)  [1^  -  bu]|z  t+dt[  f(u)  du  ,  (42) 

where  again  by  $  1  (dt)  [I^1  -bu]  we  mean  the  set  {j_  e  Fln :  (dt)  ^  +  t)  u  e  I^1} .  Thus  up  to  higher 


*  Observe  that  the  driving  point  process  need  not  be  Poisson  for  the  development  to  follow. 

t  We  will  restrict  attention  to  f(*)  described  by  either  (7)  or  (8). 

t  By  the  notation  O(dt)  we  mean  a  quantity  such  that  lim  [0(dt)/dt]  <  oo. 

dt-0 
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order  terms  in  dt  we  have  obtained 


p{x  ;  t  +  dt 


zt0>t+dt>  dx  =  H  -A(t)  dt]  Pr|x(t)  £  ♦"‘(dt)  £|zto>udt| 

+  A (t)  dt  V  Pr  |x(t)  £  *"  (dt)  [i"  -  bu]|zt  t+dtjf(u)du 


(43) 


from  which  it  follows  that 


p{x;t  +  dtlztQ  t+dt}  =  t1  ”  x(t)  dtJ  det  1£(dt>  P{£_1(dt)  iltlzt0>t+dt} 

+  A(t)  dt  det_1£(dt)  j  p{£_1(dt)  (x  -  b  u];  t  |zt  t+dJ  f(u)  du 


(44) 


where  det#(dt)  represents  the  Jacobian  of  the  transformation  relating  x  (t  +  dt)  to  x(t).  At  this 
point  some  approximations  are  in  order.  By  expanding  the  first  conditional  probability  on  the 
right-hand  side  of  (44)  in  a  Taylor  series  about  the  point  x,  we  have 


p(f _1  (dt>  £ ;  *  I  zt  t +dt}  =  ;  * 1  ztn,t+dt>  " 


3P{x;t|zt  +dt> 


a  x 


dt  +  o(dt) 


L0* -  O' 

where  the  superscript  T  denotes  transpose.  Similarly,  we  have 

P{£_1  (dt)  lx-bu];t|zt  >t+dt}  =  p{x  -  bu;t|zt  >t+dt}  +  O(dt) 


(45) 


(46) 


where  O(dt)  represents  a  quantity  such  that  0(dt)/dt  —  M  a  nonzero  constant  as  dt  —  0.  Finally, 
after  observing  that 

det_1^(dt)  =  1  -  Tr{A}  dt  +  o(dt)  ,  (47) 

it  follows  that  upon  substituting  these  results  into  (44)  and  grouping  together  terms  in  dt 
p{x;t  +  dt  I zto>t+dt}  =  f1  _  tA(t)  +  Tr4J  dt)  P{*;t|zt  t+dJ 

>T  9 


-{Ax}  gY  P(2->'tlzto,t+dJ 


dt 


+  A  (t)  dt 


J^Ptx-bujtlz^^} 


f(u)  du  +  o(dt) 


which  is  more  compactly  written  as 

p{x;t  +  dt|zto  t+dt}  =  p{x;t|zto  t+dt}  -Tr(g^Axp{x;t|zto  t+dt})  dt 

r°° 

—  A (t)  dt  j  [p{x;t|zto  t+dt}  -  p{x -bu;t|zt^>t+dt}J  f(u)  du  +  o(dt) 
Finally,  defining 

dP(x  ;  1 1  z  }  A  p{x;t  +  dt|z  t  ..}  -  p{x  ;  1 1  z  }  , 

l0’  l0’  al  tQ,t 


(48) 


(49) 


(50) 
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we  have  upon  neglecting  higher  order  terms  in  dt 

dpfX  ;  1 1  =  dq  [x  ;  t]  -  Tr  A  x  p{x  ;  1 1  z^^dt 

*00 

"x(t)  dti.M  N-;t|zt0,t+dt> "p^"^u;tlzt0,t+dt>J f(u)  du  •  <51> 

At  this  point  it  is  possible  to  divide  both  sides  of  this  last  expression  by  dt  and  pass  to  the  limit 
as  dt  —  0  with  the  result 

3p{x;t|vt  t} 

- =  L+P^;tlvt0,t>  +ao2p^:tlVt0,t> 

X  [v(t) -<c,x(t)>]  [<c,x(t)> -<c,x(t)>]  ,  (52) 

which  is  the  desired  result  [Eq.  (20)].  The  operator  L+  which  represents  the  forward  differential 
generator  of  the  shot  process  has  been  defined  previously  in  (21)  and  the  sequel. 

V.  LEAST-SQUARE  ESTIMATOR  EQUATIONS 

Differential  equations  for  the  least-squares  estimator  x  (t)  given  by  (19)  can  be  readily  ob¬ 
tained  from  the  results  of  the  preceding  section.  In  particular,  if  we  multiply  both  sides  of  the 
modified  Fokker-Planck  equation  (51)  by  x  and  integrate  over  Rn  we  obtain,  as  shown  in  Appen¬ 
dix  A, 

x(t)  =  Ax(t)  +mubA(t)  +aQ2K(t)  c{v(t)  -(c,x(t))}  .  (53) 

Here  the  gain  matrix  K  (t)  is  given  by  the  variance  of  the  estimation  error 

K  (t)  =  E  J  [x  (t)  -x(t)]  [x(t)  -  x(t)]T|vt  t|  ,  (54) 

and  mu  =  E{u}  is  the  mean  of  the  pulse  amplitudes  -  actually  zero  for  the  distributions  described 
by  Eqs.  (7)  and  (8).  Figure  3  illustrates  the  implementation  of  the  estimator  x  (t)  under  the 
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assumption  m  =0.  The  structure  is  exactly  that  of  the  Kalman-Bucy  filter  operating  on  Gauss- 
Markov  data,  the  only  difference  being  in  the  way  the  gain  matrix  j£(t),  or  equivalently  the  error 
covariance  matrix,  is  obtained.  As  in  the  Kalman-Bucy  theory,  a  differential  equation  for  the 
error  covariance  matrix  can  be  obtained  from  the  results  of  the  preceding  section.  In  Appen¬ 
dix  A  it  is  shown  that  the  error  covariance  matrix  K  (t)  satisfies  the  matrix  differential  equation 

K  (t)  =  A  K  (t)  +  K  (t)  AT  -  Aft)  [cr^  +  m^]  B  -  a^Kft)  CK(t) 

+  ao"2  [vft)  -<c  ,£(«>]  e| [x  (t)  -  £(t)J  [x  ft)  -x(t)]T 

X  [x  (t)  -  x  (t)]Tc  |v  I  .  (55) 

l0’  1 

T  T  2 

Here  B  =  bb  ,  C  =  cc  ,  and  is  the  variance  of  the  pulse  amplitudes  which  excite  the  linear 
dynamical  system  to  obtain  the  shot  process.  Observe  that  if  the  Poisson  point  process  u(t)  were 
replaced  by  a  WGN  process  the  last  term  in  (55)  vanishes  since  odd  moments  (in  this  case  the 
third)  of  Gaussian  zero-mean  variates  are  identically  zero.  In  the  case  we  have  treated,  these 
odd  moments  need  not  vanish  and  the  gain  matrix  K  (t)  depends  explicitly  upon  both  the  input  data 
and  the  current  estimate  x  (t)  through  this  last  term.  This  results  in  a  nonlinear  receiver  struc¬ 
ture  and  considerably  complicates  the  receiver  implementation  since  now  the  gain  cannot  be  de¬ 
termined  a  priori  before  observations  are  taken.  Indeed,  determination  of  K(t)  from  (55)  requires 
evaluation  of  third  moments  of  the  estimation  error.  It  can  be  shown  that  differential  equations 

can  be  obtained  for  the  third  and  higher  order  moments  which  will  be  found  to  depend  upon  still 

2 

higher  order  moments  of  the  estimation  error  as  well  as  the  noisy  observations.  Snyder  has 
suggested  that  truncation  of  this  dependency  on  higher  order  moments  at  some  point  will  result 
in  a  reasonable  suboptimum  detector  structure.  Such  questions  are  objects  of  continuing 
investigations. 

VI.  SUMMARY  AND  CONCLUSIONS 

We  have  explicitly  defined  the  optimum  detector  structure  for  reception  of  M-ary  orthogonal 
signals  in  ELF  noise  environments.  This  structure  has  been  shown  to  bear  some  resemblence 
to  that  which  would  have  been  obtained  in  the  presence  of  Gaussian  noise  alone.  The  presence 
of  the  low-density  shot  component  has  been  shown  to  severely  complicate  the  receiver  implemen¬ 
tation.  Studies  in  progress  will  be  concerned  with  the  implementation  and  performance  of  sev¬ 
eral  suboptimum  versions  of  this  structure  obtained  as  described  in  the  preceding  section  by 
truncating  the  dependency  of  the  gain  matrix  on  higher  order  moments  at  some  level.  Also  under 
study  are  techniques  for  obtaining  exponential  error  bounds  on  error  probability  performance 
over  the  ELF  channel  and  adaptive  identification  techniques  for  determining  the  linear  system 
dynamics  and  the  time-varying  intensity  of  the  Poisson  point  process  from  observations.  Another 
area  which  has  been  under  continuing  study  is  that  of  the  evaluation  of  the  relative  performance 
of  various  suboptimum  nonlinear  detector  structures  of  the  limiting  variety.  These  results  will 
be  described  in  a  later  report. 


11 


APPENDIX  A 

DERIVATION  OF  LEAST-SQUARES  ESTIMATOR  EQUATIONS 

Here  we  desire  to  develop  a  differential  equation  for  the  least-squares  estimator  x  (t)  given 
by  (19).  First  define  the  quantity 

dx(t)  =x(t  +  dt)  —  x(t) 


=  \  xdP<x;1:|z  t)  dx 
JRn 


(A-l ) 


so  that  from  (51) 


dx(t)  =  xdq  [x ;  t]  dx  -  dt  xTr  Ax  p{x  ;  1 1  z^t}j  dx 

-A(t)dtj  n2i{J  [p{x;t|zto  t}  “  P{x-  bu;t|zt^  J  ]  f(u)  duj  dx_ 

Now  evaluating  each  of  these  integrals  separately  it  follows  first  from  (34)  that 

f  xdq  [x  ;t]  dx  =  cr^K(t)  c{dz(t)  -  <c  ,x  (t)>  dt} 

where  I£(t)  is  the  covariance  of  the  estimation  error  given  by 

K.(t)  =  \  _  [x-£(t)J  [x-£(t)]T  p{x.;t|z  .}  dx. 

JR  V1 

=  E|[x(t)  -x(t)]  [x(t)  -£(t)]T|zt  t| 


(A-2) 


(A-3) 


(A-4) 


.th 


Similarly,  if  we  let  {Ax}-  denote  the  i  component  i  =  1,  2,  .  ,  . ,  n  of  Ax,  then  under  the  assumption 


p{x;t|z  A  =  o  ;  i  =  1, 2, ...  n 

1  1  Ln,L 


whenever  x.  =  ±  «>,  it  follows  after  integration  by  parts  that 


3r»  iTr  te  1  v>) di ' ' 


(t) 


(A-5) 


(A-6) 


Finally,  after  some  algebra  it  can  be  shown  that  the  last  integral  in  (A-2)  can  be  expressed  as 

j  nxjj  [p{x  J 1 1  zt  t)  -P(x -bu;t|zt^  t]|  f(u)  duj  dx  =  -bm^  ,  (A-7) 

where  =  E(u)  is  the  mean  value  of  the  exciting  pulse  amplitudes.  Thus,  the  expression 
(A-l)  for  dx(t)  becomes 
,  a 


dx(t)  =  A  x  (t)  dt  +  m^  bX(t)  dt  +  (t)  c{dz(t)  —  <c  ,  x  (t)>  dt} 


(A-8) 


so  that,  after  dividing  both  sides  by  dt  and  making  use  of  the  fact  that  dz(t)  =  v(t)  dt,  we  arrive 
at  Eq.  (53)  of  the  text. 
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APPENDIX  B 

ERROR  COVARIANCE  EQUATION 


Bet  us  first  observe  the  vector-matrix  identity 

[x  —  £  (t)  ]  [x  —  £ (t)]T  =  [x  —  x  (t  +  dt ) ]  [x  —  x  (t  +  dt)]^  +  dx  (t)  d£^(t) 

+  [x  -  x  (t  +  dt)]  d (t)  +  d£  (t)  [x  —  x  (t  +  dt)]"1"  ,  (B-l ) 

where  dx(t)  =  £(t  +  dt)  -  x  (t).  Now  multiplying  the  right-hand  side  of  this  last  expression  by 

the  left-hand  side  of  (51),  and  the  left-hand  side  by  the  right-hand  side  of  (51),  we  obtain  (see 

Ref.  9  for  similar  results),  after  integrating  over  Rn, 

dK(t)  +  d£(t)  dxT (t)  =  -  dt  j  n  [x  -x.(t)]  [x  -  x  (t)]TTr  A  xp{x  ;  1 1  t+dt}j  dx 

-  Mt)  dt  [x  -  x  (t)]  [x  -  x  (t)]T  E  [AuP(x.;  t  |zt^>t+dt}]  dx 

+  a  ”2  [dz(t)  -  <c  ,  £(t)>  dt]  (  [x  -  £(t)]  (x  -  x  (t)]T  [<c  ,x  (t)> 
o  jRn 

-<c  ,x(t)>]  p{x;t|z  ,}dx  ,  (B-2) 

0’ 

where 

K(t)  =  i  |x -x  (t)]  [x -x(t)]Tp{x;t|z  ]  dx 

—  JRn  iQ,i 

E{[X  -.x(t)]  [x  -  x  (t)]T  |z  }  ,  (B-3) 

l0’ 

and  dK(t)  =  KJt  +  dt)  —  JC  (t).  Evaluating  the  first  integral  on  the  right-hand  side  of  (B-2)  by  parts, 
again  under  similar  assumptions  as  stated  in  (A -5),  it  follows  that 

j  n  [x  -  X  (t)]  [x -£(t)]T  Axp{x  ;  t  |zt  dx  =  -[AK(t)  +  K(t)AT]  .  (B-4) 

Similarly,  it  can  be  shown  easily  that 

^  n  [x -x(t)]  [x -x(t)]TE  [Aup{x;tlzto>t+dt}j  dx  =  [(*„  +  1  .  (B-5) 

where  a  =  var{u}  is  the  variance  of  the  pulse  amplitudes  represented  by  the  generic  random 

U  rj> 

variable  u  and  B  =  b  b  .  Thus  it  follows 

djx  (t)  +  dx(t)  dx^(t)  =  [A  K_(t)  +  K_(t)  A^]  dt  —  A(t)  dt  +  m^ )  13 

+  a  "2  [ dz(t)  -<c  ,x. (t)>  dt J  e|[x  -  $(t)j  [ x  x  (t) ]T  [x  -  £ (t)]Tc  |z  I  .  (B-6) 

I  t  I 

Observe  from  (A-8)  that  after  dropping  obvious  terms  of  higher  order  in  dt 

d£(t)  dxT(t)  =  <j~4  [dz(t) -<c  ,  x>dt]2  K(t)  CK  (t)  ,  (B-7) 
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(B-8) 


T 

where  C  =  cc  .  Furthermore,  since 

dz(t)  =  <c  ,x(t)>  dt  +  d£(t) 

where  d£(t)  =  w(t)  dt  and  £(t)  is  a  Brownian  motion  process,  we  have  up  to  terms  o(dt) 
dx(t)  dxT(t)  =  <rj4d2$(t)  K  (t)  CK  (t) 

=  cro"2  dtK(t)  CK  (t)  ,  (B-9) 

9  2 

where  we  have  made  use  of  the  fact  that  the  process  d  £  (t)  is  essentially  deterministic  and 
2 

equal  to  dt. 

Finally,  we  obtain 

dK(t)  =  [A  K  (t)  +  K(t)  AT]  dt  -  A(t)  dt  [a  2  +  m^]  B -ctJ2  dtK(t)  C  K  (t) 

+  crj2  [dz(t)  -  <c  ,  x  (t)>  dt]  e|  [x  -  x  (t) ]  [x  -  x  (t)]T  [x  -  x  (t) ]Tc  |  tj  .  (B-10) 

whereupon,  dividing  both  sides  of  this  last  expression  by  dt  while  passing  to  the  limit  as  dt  —  0 
and  making  use  of  the  fact  dz(t)  =  v(t)  dt,  we  obtain  Eq.  (55)  of  the  text. 
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